Fibronectin binding protein as well as its preparation

ABSTRACT

The present invention relates to a new recombinant hybrid-DNA-molecule comprising a nucleotide sequence from S. aureus coding for a protein, or polypeptide, having fibronectin binding properties.

This application is a divisional of application Ser. No. 08/259,000, filed Jun. 13, 1994, now U.S. Pat. No. 5,571,514, which is a divisional of Ser. No. 08/007,817, filed Jan. 22, 1993, now U.S. Pat. No. 5,320,951, which is a continuation of Ser. No. 07/201,028, filed Jun. 1, 1988 now abandoned.

TECHNICAL FIELD

The present invention relates to a fibronectin binding protein as well as hybrid-DNA-molecules, e.g. plasmids or phages comprising a nucleotide sequence coding for said protein. Further the invention relates to microorganisms comprising said molecules and their use producing said protein, as well as the synthetic preparation of said protein.

The object of the present invention is to obtain a minimal fibronectin binding protein.

A further object is to obtain said protein by means of a genetic engineering technique by using e.g. a plasmid comprising a nucleotide sequence coding for said protein.

A further object is to obtain a possibility of preparing said protein by chemical synthesis.

Further objects will be apparent from the following description.

BACKGROUND OF THE INVENTION

WO-A1-85/05553 discloses bacterial cell surface proteins having fibronectin, fibrinogen, collagen, and/or laminin binding ability. Thereby it is shown that different bacteria have an ability to bind to fibronectin, fibrinogen, collagen, and/or laminin. It is further shown that fibronectin binding protein has a molecular weight of 165 kD and/or 87 kD, whereby it is probable that the smaller protein is a part of the larger one.

Fibronectin is a large glycoprotein (M_(r) ca 450 kd) with two similar subunits, which may vary in molecular size depending on a complex splicing pattern of a precursor mRNA (1). The major function of fibronectin, which is found in body fluids, blood clots and extracellular matrices, seems to be related to the ability of the protein to mediate substrate adhesion of most eukaryotic cells (2, 3, 4, 5.)

In the late seventies, Kuusela found that fibronectin not only interacts with eucaryotic cells but also binds to cells of Staphylococcus aureus (6). Since this observation, a number of pathogenic microorganisms have been shown to bind to fibronectin with a high degree of specificity and a high affinity (7). Fibronectin in the extracellular matrix appears to serve as a substratum also for the adhesion of different microorganisms. The binding of fibronectin may for some microorganisms represent a crucial step in the colonization of host tissue and development of infection.

Several different cell surface components have been implicated as fibronectin receptors on Gram-positive bacteria including lipotechioc acid (8, 9) and protein (10). In previous studies a fibronectin binding protein with a M_(r) of 197-210 kD has been isolated from S. aureus strain Newman (11, 12) and tentatively identified as a fibronectin receptor. To further characterize this fibronectin binding protein from S aureus, the gene for this protein has been cloned in E. coli. The fibronectin binding domain within this protein has also been localized and the behaviour of a fusin protein containing this domain and IgG-binding regions of protein A will be disclosed below.

DESCRIPTION OF THE INVENTION

It has now surprisingly been found possible to obtain a hybride-DNA-molecule comprising a nucleotide sequence coding for a protein or a polypeptide having fibronectin binding properties. As evident from below the following nucleotide sequence is present in the gene coding for said protein SEQ ID NO.:!:

    __________________________________________________________________________     GGC                                                                               CAA                                                                               AAT                                                                               AGC                                                                               GGT                                                                               AAC                                                                               CAG                                                                               TCA                                                                               TTC                                                                               GAG                                                                               GAA                                                                               GAC                                                                               ACA                                                                               GAA                                                                               GAA                                  GAC                                                                               AAA                                                                               CCT                                                                               AAA                                                                               TAT                                                                               GAA                                                                               CAA                                                                               GGT                                                                               GGC                                                                               AAT                                                                               ATC                                                                               GTA                                                                               GAT                                                                               ATC                                                                               GAT                                  TTT                                                                               GAT                                                                               AGT                                                                               GTA                                                                               CCT                                                                               CAA                                                                               ATT                                                                               CAT                                                                               GGT                                                                               CAA                                                                               AAT                                                                               AAA                                                                               GGT                                                                               AAT                                                                               CAG                                  TCA                                                                               TTC                                                                               GAG                                                                               GAA                                                                               GAT                                                                               ACA                                                                               GAA                                                                               AAA                                                                               GAC                                                                               AAA                                                                               CCT                                                                               AAG                                                                               TAT                                                                               GAA                                                                               CAT                                  GGC                                                                               GGT                                                                               AAC                                                                               ATC                                                                               ATT                                                                               GAT                                                                               ATC                                                                               GAC                                                                               TTC                                                                               GAC                                                                               AGT                                                                               GTG                                                                               CCA                                                                               CAT                                                                               ATT                                  CAC                                                                               GGA                                                                               TTC                                                                               AAT                                                                               AAG                                                                               CAC                                                                               ACT                                                                               GAA                                                                               ATT                                                                               ATT                                                                               GAA                                                                               GAA                                                                               GAT                                                                               ACA                                                                               AAT                                  AAA                                                                               GAT                                                                               AAA                                                                               CCA                                                                               AGT                                                                               TAT                                                                               CAA                                                                               TTC                                                                               GGT                                                                               GGA                                                                               CAC                                                                               AAT                                                                               AGT                                                                               GTT                                                                               GAC                                  TTT                                                                               GAA                                                                               GAA                                                                               GAT                                                                               ACA                                                                               CTT                                                                               CCA                                                                               AAA                                                                               GTA                                                                               AGC                                                                               GGC                                                                               CAA                                                                               AAT                                                                               GAA                                                                               GGT                                  CAA                                                                               CAA                                                                               AGC                                                                               ATT                                                                               GAA                                                                               GAA                                                                               GAT                                                                               ACA                                                                               ACA                                                                               CCT                                                                               CCA                                                                               ATC                                                                               GTG                                                                               CCA                                                                               CCA                                  ACG                                                                               CCA                                                                               CCG                                                                               ACA                                                                               CCA                                                                               GAA                                                                               GTA                                                                               CCA                                                                               AGT                                                                               GAG                                                                               CCG                                                                               GAA                                                                               ACA                                                                               CCA                                                                               ACG                                  CCA                                                                               CCA                                                                               ACA                                                                               CCA                                                                               GAA                                                                               GTA                                                                               CCA                                                                               AGT                                                                               GAG                                                                               CCG                                                                               GAA                                                                               ACA                                                                               CCA                                                                               ACA                                                                               CCA                                  CCG                                                                               ACA                                                                               CCA                                                                               GAA                                                                               GTG                                                                               CCG                                                                               AGT                                                                               GAG                                                                               CCA                                                                               GAA                                                                               ACT                                                                               CCA                                                                               ACA                                                                               CCG                                                                               CCA                                  ACA                                                                               CCA                                                                               GAG                                                                               GTA                                                                               CCA                                                                               GCT                                                             __________________________________________________________________________

The invention further comprises a plasmid or phage comprising a nucleotide sequence coding for said fibronectin binding protein.

The invention further comprises a microorganism comprising at least one hybrid-DNA-molecule according to the above.

The invention further comprises a method for producing a fibronectin binding protein whereby at least one hybrid-DNA-molecule of above is introduced into a microorganism, cultivating said microorganism in a growth medium, and isolating the protein thus formed by means of an affinity chromatography on a fibronectin bound to an insolubilized carrier followed by ion exchange chromatography.

A further aspect of the invention comprises a chemical synthesis of the fibronectin binding protein, whereby an amino acid sequence is built up based on said nucleotide sequence encoding for said protein starting from the C-terminal histidine which is stepwise reacted with the appropriate amino acid, whereby it is finally reacted with glycine at the N-terminal end, to form the fibronectin binding peptide region.

Appropriate carrier proteins can be coupled to the amino acid sequence as well, such as IgG binding regoins of protein A.

The invention will be described in the following with reference to the examples given, however, without being restricted thereto.

EXAMPLE 1 Screening of a Gene Bank for Fibronectin Binding Protein (FNBP)

A gene bank in plasmid pBR322 of chromosomal DNA from Staphylococcus aureus strain 8325-4 earlier described by (13) was screened for clones expressing FNBP. E. coli clones were lysed and the lysates were tested for their ability to inhibit the binding of ¹²⁵ I-fibronectin to cells of S. aureus Cowan I as described in the method section. To simplify screening the clones were pooled in lots of 25, lysed and tested. Out of the 22 pools tested, the one with the highest inhibitory activity was retested in 5 pools of 5. Finally, the individual clones in one positive pool were tested resulting in the isolation of one single positive clone. The plasmid in the positive clone was called pFR001.

This plasmid pFR001 in an E. coli strain 259 has been deposited at the Deutsche Sammlung von Mikroorganismen (DSM) and has thereby been allocated the deposition number 4124.

Isolation of the Staphylococcal FNBP from E. coli pFR001

The E.coli clone, positive for FNBP, was grown in LB-medium at 37° C. to the stationary growth phase. The bacterial cells were centrifuged and lysed by an osmotic shock procedure (14). To exclude that inhibitory effect of the shock lysate on the binding of ¹²⁵ I-fibronectin to S.aureus cells is due to proteolytic activity, lysate was added to the incubation mixture and the amount of ¹²⁵ I-fibronectin bound to S.aureus was determined after 1,2,3, and 4 hrs of incubation. Throughout this incubation period, the level of inhibition caused by the lysate remained constant (e.g. 50%) suggesting that the observed inhibition was not due to a progressive degradation of the ¹²⁵ I-labelled ligand or corresponding receptor.

If the inhibitory activity is due to the presence of FNBP-like structures in the lysate, these should express a specific affinity for fibronectin. The lysate, was therefore analysed by affinity chromatography on a column of fibronectin-Sepharose as described in Materials and Methods (FIG. 1A). The purification was about 30 fold. Further fractionation was achieved by subjecting the affinity purified material to ion-exchange chromatography using a mono Q column fitted on to a FPLC-system. In this fractionation step two major peaks were obtained (FIG. 1B). Analyses by polyacrylamide gel electrophoresis indicated that the two peaks contained proteins of molecular weights 165 and 87 kD, respectively (FIG. 2). Both components inhibited the binding of ¹²⁵ I-fibronectin to S.aureus. The fraction containing the 165 kD protein had a specific inhibitory activity of 220 units/qg representing a 430-fold purification from the original shock lysate and it was 30 times more active than the 87 kD protein on a molar basis (data not shown).

Amino acid analysis of the two proteins showed that they had a very similar amino acid composition which also closely resembled that determined for the native FNBP isolated from S.aureus strain Newman (Table 1). The immuological relationship of 165 kD protein isolated from E.coli pFR001 to the native FNBP isolated from S.aureus strain Newman was analyzed. The 165 kD protein was ¹²⁵ I-labelled and immunoprecipitated with increasing dilutions of an antibody to S. aureus strain Newman. A 300-fold dilution of the antiserum precipitatd 50% of the ¹²⁵ I-labelled protein. Unlabelled 165 and 87 kD proteins as well as the native FNBP interferred with the immunoprecipitation of the labelled 165 kD protein (data not shown). These observations suggest that the 165 and 87 kD proteins isolated from S.aureus strain Newman are closely related to fibronectin binding protein (FNBP).

Identification of the region of the fnbp-gene coding for the binding activity

The size of the insert in plasmid pFR001 is about 6.5 kbp. The restriction map of the insert is shown in FIG. 3(A). In order to determine the transcription direction and to localize the region coding for the binding function, the PstI fragment of about 3.7 kbp was recloned in the PstI site of pBR322. This results in loss of the ampicillin resistance phenotype conferred by this plasmid. Eight such clones were obtained, five of which were positive and three negative for fibronectin binding activity. One of each was tested for the orientation of the PstI fragment. The plasmid of the positive clone pFR004 had the EcoRI site closest to the Amp-promoter and that of the negative clone had the reverse orientation. These data indicated that the transcription orientation is from EcoRI to PstI on the 3.7 kbp EcoRI - PstI fragment and that at least a part of the fnbp-gene, which codes for the binding function, is located on this fragment. To verify this, the Amp-promoter in pFR004 was removed by EcoRI digestion followed by religation of the plasmid. The resulting plasmid pFR008 does not express fibronectin binding activity. In pFR001 the endogenous promoter is thus presumably located somewhere on the left hand side of the EcoRI site as indicated by the arrow in FIG. 3A). By a plasmid construction (not described in detail here) the EcoRI (in the insert)-SalI (in pBR322) fragment from pFR001 was introduced into pFR008. The fibronectin binding activity was then regained, due to the restoration of the endogenous promoter from the fnbp-gene.

Knowing the transcription direction of the fnbp-gene (from left to right as drawn in FIG. 3(A)) fusions were made to the gene for staphylococcal protein A. An expression/secretion vector called pRIT3 based on the protein A gene with restriction enzyme multilinker has been constructed (15). The multilinker is placed immediately downstream of the last IgG-binding region, thus eliminating the C-terminal cellwall binding region of protein A. The 3.7 kbp EcoRI-PstI fragment from pFR001 was inserted into this vector expecting it to encode a protein A-FNBP fusion protein, providing the reading frame was correct. This was obviously the case since the clone containing this plasmid, called pFR013, is positive in tests for both protein A and FNBP.

Plasmid pFR013 was treated with exonuclease Bal31 in order to identify a region smaller than the 3.7 kbp insert coding for the fibronectin binding activity. The plasmid was cleaved with EcoRI or Pst1 (at the 5' and the 3'end of the coding region, respectively), treated for various times with Bal31 and religated. During ligations a 20-fold excess of an EcoRI linker was added in oder to introduce EcoRI sites at new positions. FIG. 3(C) shows the deletions obtained from either the 3' or the 5' end and the corresponding fibronectin binding activity. A region of the gene of about 700 bp coding for binding activity is located between the end points of deletions number 56 (deletion from the 5 end) and number 22 (deletion from the 3 end). From deletion plasmid number 56, the 900 bp region from the newly introducted EcoRI site to the PvuII site was subcloned. This fragment was cloned into pUC18 cleaved with EcoRI and SmaI. The resulting plasmid, which encodes a fusion protein with both β-galactosidase and fibronectin binding activity, is called pFR015. The fragment can be recloned from pFR015 by EcoRI and BamHI cleavages because of the multilinker in pUC18.

Restriction fragments were also subcloned as indicated in FIG. 3(B). In the case of EcoRI-PstI and EcoRI-ClaI fragments the protein A expression vector pRIT3 was used. Fragments BalI-PvuII and BalI-HincII were subcloned and expressed in pUC18. In all cases, except for the EcoRI-ClaI fragment, fusion proteins with fibronectin binding activity were obtained. The negative result in the case of the ClaI-ClaI subclone may be due to the insert appearing in the wrong reading frame.

Production and Characterization of a Fusion Protein, ZZ-FR

The yield of the 165 kD protein (FIG. 2) from an osmotic shock lysate of E. coli HB101 cells carrying the plasmid pFR001 was approximately 40 μg per liter culture medium. Even if there were losses due to degradation of the high molecular weight compound during the purification all data indicated that the fnbp-gene with the endogenous promoter is weakly expressed in E. coli. In order to improve the Level of expression we used a recently developed expression system, which allows heterologous proteins to be secreted to the growth medium of E. coli (16). The plasmid vector used, pEZZ318, contains two syntetic, sligtly modified IgG-binding domains of the gene for staphylococcal protein A preceeded by the promoter and signal sequence of the same gene. A BalI-PvuII fragment of approximately 600 bp of the fnbp-gene (FIG. 3(B)) cloned in pUC18 was recloned into pEZZ318 cleaved with EcoRI and HindIII. After ligation and transformation pEZZ-FR was isolated. This plasmid encodes a fusion protein consisting of the IgG binding product ZZ and a fibronectin binding region FR of the FNBP.

For production of the ZZ-FR protein E. coli strain HB101 carrying the plasmid pEZZ-FR was grown overnight in Trypticase Soy Broth. Bacteria were removed by centrifugation and the growth medium was passed through an IgG-Sepharose Fast Flow column. After washing the column with TST-buffer (50 mM Tris-HCl pH 7.4 150 mM NaCl, 0,05% Tween® 20) the ZZ-FR protein was eluted with 0,5M acetic acid titrated to pH 2.8 using ammonium acetate. Approximately 50 mg protein was eluted from the column per liter of growth medium applied.

After lyophilization the eluted material was analyzed by SDS-PAGE, which revealed a major protein band at approximately 63 kD (FIG. 4). In addition, bands corresponding to smaller fragments appeared, probably due to proteolytic degradation of the fusion protein. The intact protein A which was run on the same gel appeared as a diffuse band around 56 kD. The proteins in the gel were electrophoretically transferred to a nitrocellulose paper and probed with a ¹²⁵ I-labelled 29 kD fibronectin fragment. FIG. 4, lanes C and D, shows that the fusion protein but not intact protein A, binds the radiolabelled fibronectin fragment.

Further evidence for the fibronectin binding ability of the ZZ-FR fusion protein was obtained by affinity chromatography on a Sepharose column substituted with the 29 kD fibronectin fragment. The fusion protein was bound to the column and was eluted from the affinity matrix with 6M GuHCl (FIG. 5A). The fibronectin binding activity of the ZZ-FR fusion protein was apparently located in the FR-region since intact protein A did not bind to the affinity matrix (FIG. 5B). The portion of the ZZ-FR fusion protein preparation that did not bind to the affinity matrix consisted of proteins with M_(r) lower than of the intact fusion protein (FIG. 6, lane A) whereas the material binding to the column consisted of an almost pure preparation of intact 63 kD ZZ-FR fusion protein (FIG. 6, lane B).

In oder to determine how much of the fibronectin binding capacity of staphylocaccal cells that can be ascribed to the cloned FNBP, the inhibitory activity of the fusion protein ZZ-FR was quantified. As shown in FIG. 7 the fusion protein totally inhibited the binding of ¹²⁵ I-labelled 29 kD fragment as well as intact fibronectin to cells of both S. aureus strains Newman and 8325-4, protein A, which was used as a control, did not inhibit the binding (FIG. 7).

After the report by Kuusela (6) that S. aureus binds to fibronectin much work has been focused on attempts to identify the bacterial component(s) responsible for the binding. The rationale for these studies has been that binding of pathogenic bacteria to fibronectin may represent a mechanism of tissue adherence of crucial importance in the early stages of an infection. Proteins purified by affinity chromatography on immobilized fibronectin, have been implicated in the binding of staphylococcal cells to fibronectin. However the reported molecular weights of the fibronectin binding proteins vary from 18 kD (10) all the way up to 197 and 210 kD (11, 17). The main reason for the heterogeniety in molecular size may be proteolytic degradation of the proteins during the isolation procedures.

In the present disclosure the cloning in E. coli of a gene coding for a fibronectin binding protein from S. aureus strain 8325-4 is disclosed. When the fnbp-gene is expressed in E. coli from the endogenous promoter the protein can be isolated from the periplasm by osmotic shock. This indicates that not only the promoter but also the signal peptide is functional in E. coli.

Although the proteins coded for by the cloned fnbp-gene have molecular weights of 165 and 87 kD (FIG. 2), which is smaller than the FNBP isolated from S. aureus strain Newman (M_(r) =210 kD) (12), their amino acid compositions closely resemble that of the native protein (Table 1). Furthermore, antibodies raised against the native FNBP cross-react with the 165 and 87 kD proteins. These data strongly suggest that the structure of the proteins coded for by the cloned gene from S. aureus strain 8325-4 resembles that of native FNBP from S. aureus strain Newman. The 87 kD protein may be the result of pretermination at the transcriptional or translational level or alternatively proteolytic cleavage of the 165 kD protein. At present it cannot be explained why the 165 kD protein is as much as 30 times more active than the 87 kD protein in inhibition of fibronectin binding to S. aureus cells.

By deletion mapping using Bal31-clevage and subcloning of restriction fragments the domain of the fnbp-gene encoding the fibronectin binding activity had been located to a region of approximately 350 bp (FIG. 3(B)). A fragment of the gene of approximately 600 bp covering these 350 bp was ligated directly to a tandemly repeated sequence (zz) of a synthetic IgG-binding domain of the protein A gene preceeded by the protein A promoter and signal sequence in expression vector pEZZ318 (16). The resulting fusion protein (ZZ-FR), which has a molecular weight of approximately 63 kD as determined by SDS-PAGE (FIG. 4), contains 126 amino acids of the ZZ domain followed by approximately 200 amino acids encoded by the 600 bp insert from the fnbp-gene The C-terminal end of the protein consists of amino acids which are the result of an out of frame read-through into the lacZ' gene of the vector until a translation stop codon is reached. The fusion protein (ZZ-FR), which is expressed at a high level and secreted to the growth medium of E. coli, is easily isolated by affinity chromatogtaphy making use of the IgG-binding ability of the ZZ-domain.

The ZZ-FR protein was bound to the 29 kD NH₂ -terminal domain of fibronectin and completely inhibited the bindning of intact fibronectin to S. aureus (FIGS. 5 and 6). These data indicate that under the incubation conditions used other proteins, recognizing domains outside the 29 kD N-terminus of fibronectin, are not expressed by the staphylococcal cells. Furthermore the fibronectin binding activity of the FNBP has been localized to a fairly small segment of the protein. Recent analysis of the native 210 kD FNBP isolated from S. aureus strain Newman demonstrated that this protein is multivalent and one molecule of the FNBP is capable of binding 6-9 fibronectin molecules (12). The cloned fnbp-gene is derived from strain S.aureus 8325-4. If there are no differences between strains of S. aureus one would except the FR-region to contain several repeating sequences. This question was also answered by sequence analysis of the cloned fnbp-gene. The sequence of the FR-region having FNBP-properties is given in FIG. 8.

There is reason to believe that the FR binding activity is related to the each of the three 38 amino acids repeats as the Bal1-PvuII-fragment encodes for a binding activity (cf. FIG. 3); as the Bal1-HincII-fragment encodes for a binding activity; and as the HincII-PvuII-fragment does not encode for binding activity. Furthermore, the one single 38 amino acids repeat is functional in binding fibronectin, since a synthezised 38-amino acids long peptide (38(2)-repeat) has binding ability, as shown in FIG. 9. Each of the three 38 amino acids repeats are very homologous.

EXAMPLE 2

Chemical synthesis of a polypeptide based on the nucleotide sequence coding for the fibronectin binding domain of 38 amino acids (38(2) repeat) was performed by building up the amino acid sequence corresponding to said nucleotide sequence starting from the C-terminal histidin and stepwise reacting with the appropriate amino acid and finally reacting with the glycine at the N-terminal end, in a solid phase synthesis according to the method by K. B. Merrifield, J. Am. Chem. Soc. 86, pp.304, (1964). Hereby the polypeptide corresponding to the second 38 amino acid repeat was synthezised, as well as three further polypeptides being parts of this 38-repeat, viz: 1) the polypeptide covering amino acids 1-19, 2) the polypeptide covering amino acids 9-30, and 3) the polypeptide covering amino acids 20-38, were all synthezised according to the same method.

The fibronectin binding ability of the complete 38-repeat, as well as of the 1-19 amino acid polypeptide, the 9-30 amino acid polypeptide, the 20-38 amino acid polypeptide, as well as a mixture of these three smaller polypeptides was tested, and the result is given in FIG. 9. The fragments of the complete 38-repeat were synthezised in order to check if fibronectin binding is dependent upon the complete 38-repeat, as anticipated, or if fibronectin binding properties could be further confined to some smaller regoin of the 38 amino acid sequence. As evident from FIG. 9 the fibronectin binding property is only present in the complete 38 amino acids peptide.

Materials and Methods

Bacterial strains and plasmids. A gene bank in pBR322 of chromosomal DNA from Staphylococcus aureus strain 8325-4, earlier described (13) was screened for clones expressing fibronectin binding activity. E. coli strains HB102, (18) and JM105 (19), were used in subcloning and expression experiments. The plasmid vectors used were pBR322 (20) pUC18 (21) and the protein A vectors pRIT3 (15) and pEZZ318 (16). S.aureus strains Cowan I, Newman and 8325-4 were used in the assay of the fibronectin binding protein (FNBP).

Microorganism growth medium. At the culture of E. coli bacteria the following medium was used. The amounts given relates to 1 liter of medium.

    ______________________________________                                         Trypton Soy Broth (Oxoid Ltd,                                                                          30     g                                               Basingstoke, Hants, GB)                                                        Yeast Extract (Oxoid)   10     g                                               D-glucose               40     g                                               NH.sub.4 Cl             2,5    g                                               Na.sub.2 HPO.sub.4.2H.sub.2 O                                                                          7,5    g                                               KH.sub.2 PO.sub.4       3,0    g                                               Na.sub.2 SO.sub.4.10H.sub.2 O                                                                          2,5    g                                               MgSO.sub.4.7H.sub.2 O   0,2    g                                               CaCl.sub.2.2H.sub.2 O   0,5    mg                                              FeCl.sub.3.6H.sub.2 O   16,7   mg                                              ZnSO.sub.4.7H.sub.2 O   0,18   mg                                              CuSO.sub.4.5H.sub.2 O   0,16   mg                                              MnSO.sub.4.4H.sub.2 O   0,15   mg                                              CoCl.sub.2              0,10   mg                                              NaEDTA                  20,1   mg                                              ______________________________________                                    

Assay of fibronectin binding protein (FNBP). Quantitation of fibronectin binding to cells of S. aureus has been described earlier (10). If not otherwise stated, 10⁹ cells of S. aureus Cowan I are incubated with ¹²⁵ I-labelled fibronectin or the 29 kD NH₂ -terminal fragment of fibronectin in PBS containing 1 mg/ml bovine serum albumin in a total volume of 0,3 ml. After incubation for 2 hours at 22° C. the radioactivity bound to the cells is measured in a gamma counter.

Lysates of E. coli clones prepared in Tris-HCl buffer, pH 8,1, containing lysozyme EDTA as earlier described (13), were analysed for fibronectin binding activity by measuring their ability to compete with staphylococcal cells for binding the ¹²⁵ I-labelled 29 kD NH₂ -terminal fragment of fibronectin. The amount of FNBP able to inhibit binding to 50% is considered as one unit of activity.

An osmotic shock procedure was used to release proteins from the periplasmic space of E. coli (14).

Purification of FNBP. The purification is based on affinity chromatography on fibronectin-Sepharose followed by ion-exchange chromatography.

Human fibronectin was prepared from outdated blood by the method known (22). The fibronectin was then dialysed against 20 mM Tris-HCl, pH 8,3 and concentrated on a DEAE-column. The coupling of fibronectin to Sepharose SL-4B was done by a bromocyan activation procedure as known (23).

The E. coli lysate was pumped onto the affinity column which was subsequently washed with 0,5M ammonium acetate until the baseline was stable (about four column volumes). The FNBP was then eluted with 0,4M acetic acid and either neutralized with ammonia or lyophilized. After dialysis against 10 mM ammonium acetate with pH adjusted to 7,6 with ammonia a further fractionation step was performed by ion-exchange chromatography on a Pharmacia FPLC equipment using a mono Q column.

Nucleotide sequence analysis. The nucleotide sequence was determined in accordance with the methods described by Maxam, A. M. et al (27).

Amino acid analysis. The amino acid composition was determined using a Durrum D-500 analyzer. Samples were hydrolyzed in 6M HCl containing 2 mg/ml phenol for 24 hours at 110° C. One sample was also oxidized with performic acid in oder to determine cystein and methionine. Norluecin was added as an internal standard. Protein was determined according to (24) using bovine serum albumin as a standard.

Electrophoresis. If not indicated otherwise SDS-polyacrylamide gel electroporesis was performed in 5-15% gradient gels. The gels were stained with Coomassie brilliant blue, de-stained and photographed.

Radioimmunoassay procedure. The purified FNBP with an estimated molecular weight in SDS-polyacrylamide gel electrophoresis of 165 kD was labelled with ¹²⁵ I-odine by the chloramine-T method as known (25). After the iodinaton the material was rechromatographed on a fibronectin-Sepharose column.

The incubation mixture contained ¹²⁵ I-FNBP (3 400 cpm in 10 μl) was mixed with various dilutions of a rabbit antiserum (containing antibodies directed against S. aureus strain Newman), in incubation buffer (PBS, 0,1% Triton X-100 and 0,02% sodium azid) in a volume of 0,2 ml.

Samples were incubated for 2 hours at 20° C. to allow antigen-antibody reaction. 0,1 ml of 10% suspension (w/v) of protein A-Sepharose in PBS was added and the mixture was incubated for another hour. The incubation was stopped by adding another 1,7 ml of incubation buffer to the samples. After centrifugation at 2000 rpm for 3 min the supernatants were sucked off. The pellets were washed twice in incubation buffer and the radioactivity associated with the protein A-Sepharose was measured in a gammacounter.

Restriction endonucleases and other enzymes. Restriction enzymes, T4 DNA ligase and Bal31 were purchased from BRL and used according to their recommendations. Other methods involving DNA techniques were essentially as known (26).

                  TABLE 1                                                          ______________________________________                                         Comparison of amino acid compositions of fibronectin binding                   proteins (87 and 165 kD) isolated from E. coli pFR001 and the                  native FNBP isolated from S. aureus strain Newman (210 kD).                                 Composition (mol %)                                               Amino acid     210 kD.sup.a)                                                                             165 kD  87 kD                                        ______________________________________                                         Aspartic acid/asparagine                                                                      15.4       14.6    13.4                                         Threonine      9.7        10.7    10.3                                         Serine         7.4        6.5     8.0                                          Glutamic acid/glutamine                                                                       17.9       17.1    15.1                                         Proline        6.3        6.2     5.8                                          Glycine        8.9        7.9     8.4                                          Alanine        5.3        4.6     4.7                                          Half-cystine   0.1        0.2     n.d.                                         Valine         7.2        7.8     8.6                                          Methionine     0.6        0.6     n.d.                                         Isoleucine     4.3        4.7     3.8                                          Leucine        4.1        4.0     4.6                                          Tyrosine       2.1        2.3     4.1                                          Phenylalanine  2.4        2.0     3.6                                          Histidine      1.0        3.2     3.0                                          Lysine         5.3        6.3     6.6                                          Tryptophan     n.d.       n.d.    n.d.                                         Arginine       1.9        1.2     --                                           ______________________________________                                          .sup.a) from Froman et al. (1987), (12).                                       n.d. = not determined                                                    

The presnt fibronectin binding protein can be used for immunization, whereby the protein, preferably in combination with a fusion protein to create a large antigen to respond to, is injected in dosages causing immunological reaction in the host mammal. Thus the fibronectin binding protein can be used in vaccination of ruminants against mastitis caused by Staphylococcal infections. The fibronectin binding protein of this invention has shown to form antibodies against a staphylococcal mastitis in a mouse model as shown in the Table below.

                  TABLE                                                            ______________________________________                                         Experimental mouse mastitis produced by S. aureus strain SA                    113(83A) in a dose of 1,0 × 10.sup.3 cfu. Evaluation of                  immunization                                                                   using 15 μg protein per mouse of the fibronectin binding                    protein expressed from E. coli containing plasmid pFR001                       as identified herein.                                                                No. of               No. of                                                    mam-                 mam-                                                Group mary    Gross examination                                                                           mary  Microscopic                                   of    glands  type of lesion (%)                                                                          glands                                                                               type of lesion (%)                            mice  inocul. +++    ++  +   0   invest.                                                                              A   B   C1  C2  C3                      ______________________________________                                         Vac-  30      3      8   83  10  11    9   0   0   91  0                       cinated                                                                        (FNBP)                                                                         Control                                                                              38      50     16  32  3   8     38  0   25  38  0                       ______________________________________                                          Mastitis: +++ = gross; ++ = mediumgrade; + = low grade; 0 = no macroscopi      changes;                                                                       A = consistently nonreactive, total necrosis; B = advanced regressive          changes + slight inflammatory reaction; C1 = disseminated inflammatory         reaction + local necrosis; C2 = disseminated inflammatory reaction; C3 =       local inflammatory reaction; 0 = no reaction.                            

As evident from the Table above a consistent immunization is obtained using the FNBP as expressed by the E. coli containing the plasmid pFR001.

Further, the fibronectin binding protein can be used to block an infection in an open skin wound by wound treatment using the fibronectin binding protein in a suspension. Thus the fibronectin binding protein can be used for the treatment of wounds, e.g. for blocking protein receptors, or for immunization (vaccination). In the latter case the host body produces specific antibodies, which can protect against invasion of bacterial strains comprising such a fibronectin binding protein. Hereby the antibodies block the adherence of the bacterial strains to damaged tissue.

Examples of colonizing of a tissue damage are:

a) colonizing of wounds in skin and connective tissue, which wounds have been caused by a mechanical trauma, chemical damage, and/or thermical damage;

b) colonizing of wounds on mucous membranes, such as in the mouth cavity, or in the mammary glands, urethra, or vagina;

c) colonizing on connective tissue proteins, which have been exposed by a minimal tissue damage (microlesion) in connection with epithelium and endothelium (mastitis, heart valve infection, hip exchange surgery).

When using the present FNBP, or the 38 amino acid polypeptide, for the purpose of immunization (vaccination) in mammals, including man, the protein, or polypeptide is dispersed in sterile, isotonic saline solution, optionally while adding a pharmaceutically acceptable dispersing agent. Different types of adjuvants can further be used in order to sustain the release in the tissue, and thus expose the protein or the peptide for a longer time to the immundefense system of a body.

A suitable dosage to obtain immunization is 0,5 to 5 μg of FNBP, or polypeptide, per kg bodyweight and injection of immunization. In order to obtain a durable immunization, vaccination should be carried out at more than one consecutive occasions with an interval of 1 to 3 weeks, preferably at three occasions.

When using the present FNBP, or polypeptide, for topical, local administration the protein is dispersed in an isotonic saline solution to a concentration of 25 to 250 μg per ml. The wounds are then treated with such an amount only to obtain a complete wetting of the wound surface. For an average wound thus only a couple of milliliters of solution are used in this way. After treatment using the protein solution the wounds are suitably washed with isotonic saline or another suitable wound treatment solution.

Further the fibronectin binding protein as well as the minimal fibronectin binding site polypeptide, of the present invention can be used to diagnose bacterial infections caused by Staphylococci strains, whereby a fibronectin binding protein of the present invention is immobilized on a solid carrier, such as small latex or Sepharose® beads, whereupon sera containing antibodies are allowed to pass and react with the FNBP thus immobilized. The agglutination is then measured by known methods.

Further, the FNBP, or the polypeptide can be used in an ELISA test (Enzyme Linked Immuno Sorbent Assay; E Engvall, Med. Biol. 55, 193, (1977)). Hereby wells in a polystyrene microtitre plate are coated with the FNBP, and incubated over night at 4° C. The plates are then thoroughly washed using PBS containing 0.05% TWEEN 20, and dried. Serial dilution of the patient serum were made in PBS-Tween, were added to the wells, and incubated at 30° C. for 1.5 hrs. After rinsing antihuman-IgG conjugated with an enzyme, or an antibovine-IgG conjugated with an enzyme, respectivel, horseradishperoxidase or an alkaline phosphatase, was added to the wells and incubated at 30° C. for 1,5 hrs, whereupon when the IgG has been bound thereto, and after rinsing, an enzyme substrate is added, a p-nitrophosphate in case of an alkaline phosphatase, or ortophenylene diamine substrate (OPD) in case a peroxidase has been used, respectively. The plates comprising the wells were thus then rinsed using a citrate buffer containing 0.055% OPD, and 0.005% H₂ O₂, and incubated at 30° C. for 10 min. Enzyme reaction was stopped by adding a 4N solution of H₂ SO₄ to each well. The colour development was measured using a spectrophotometer.

Depending on the type of enzyme substrate used a fluoroscense measurement can be used as well.

Another method to diagnose Staphylococci infections is by using the DNA gene probe method based on the FNBP sequence or the 38 amino acid polypeptide sequence. Thereby the natural or synthetic DNA sequences are attached to a solid carrier, such as a polystyrene plate as mentioned above, by e.g. adding a milk in the case of diagnozing a mastitis, to the surface. The DNA gene probe, optionally labelled enzymatically, or by a radioactive isotope is then added to the solid surface plate comprising the DNA sequence, whereby the DNA gene probe attaches to the sequence where appearing. The enzyme or the radioactive isotope can then readily be determined by known methods.

Above the term fibronectin binding protein includes any of the 38 amino acid polypeptide sequences as well, which 38 amino acid polypeptide sequences forms the minimal fibronectin binding site of the complete protein.

REFERENCES

8. Beachey, E. H. and Simpson, W. A (1982). Infection 10, 107-110.

20. Bolivar, F., Rodriquez, R. L., Greene, P. J., Betlach, M. C., Heyneker, H: L., Boyer, H. W., Crosa, J. H. and Falkow, S. (1977). Gene, 2, 95-113.

18. Boyer, H. W. and Roulland-Dussoix, D. (1969). J. Mol.Biol: 41, 459-472.

9. Courtney, H. S., Ofek, I., Simpson, W. A., Hasty, D. L. and Beachey, E. H. (1986). Infect. Immun. 53, 454-459.

11. Espersen, F. and Clemmensen, I. (1982). Infect. Immun. 37, 526-531.

17. Froman, G., Switalski, L. M., Guss, B., Lindberg, M., Hook, M. and Wadstrom, T. (1986) In Lark, D. L. ed. Protein-Carbohydrate Interactions in Biological Systems. Academic Press, London, pp. 263-268.

12. Froman, G., Switalski, L. M., Speziale, P. and Hook, M. (1987) in press. J. Biol. Chem.

25. Hunter, W. M. (1978) Radioimmunoassay. In: Wier, K. M. ed. Handbook of Experimental Immunology. London Blackwell. 14.1-14.40.

1. Hynes, R. O. (1985) Annu. Rev. Cell Biol. 1, 67-90.

2. Hynes, R. O. (1986) Sci. Ann. 254, 42-51.

6. Kuusela, P. (1978) Nature 276, 718-720.

24. Lowry, O. H., Rosebrough, N. J., Farr, A. L. and Randall, R. J., Biol. Chem. 193, 265-275.

13. Lofdahl, S., Guss B., Uhlen, M., Philipson, L. and Lindberg, M. (1983) Proc. Natl. Acad. Sci. USA 80, 697-701.

26. Maniatis, T., Fritsch, E. F. and Sambrok, J. (1982). Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory, New York.

23. March, S. C., Parikh, I. and Cuatrecasas, P. (1974). Analyt. Biochem. 60, 149-152.

27. Maxam, A. M. and Gilbert, W., (1977), Proc. Natl. Acad. Sci., USA, 74, 560.

28. Merrifield, K. B., J. Am. Chem. Soc., 86, pp.304, (1964)

19. Messing, J. and Carlsson, J. (1984). J. Biotechnol. 1, 253-264.

22. Miekka, S. I., Ingham, K. C. and Menache, D. (1982). Thromb Res. 27, 1-14.

15. Nilsson, B., Abrahamsen, L. and Uhlen, M. (1985). EMBO J. 4, 1075-1080.

16. Nilsson, B., Moks, T., Jansson, B., Abrahamsen, L., Elmblad, A., Holmgren, E., Henrichson, L., Jones, T. A. and Uhlen, M. (1986). Protein Engineering, In press.

21. Norrander, J., Kempe, T. and Messing, J. (1983). Gene, 26, 101-106.

14. Nossal, N. G. and Heppel, L. A. (1965). J. Biol. Chem. 241, 3055-3062.

3. Ruoslahti, E. and Pierschbacher, M. D. (1986). Cell, 44, 517-518.

10. Ryden, C., Rubin, K., Speziale, P., Hook, M., Lindberg, M. and Wadstrom, T. (1983), J. Biol. Chem. 258, 3396-3401.

7. Wadstrom, T., Switalski, L., Speziale, P., Rubin, K., Ryden, C., Froman, G., Faris, A., Lindberg, M., Hook, M., (1985), In Jackson, G. J. (ed), Pathogenesis of Infection. Springer Verlag, Berlin, Heidelberg, New York, Tokyo, pp. 193-207.

4. Woods, A., Couchman, J. R., Johansson, S., and Hook, M. (1986), EMBO J. 5, 665-670.

5. Yamada, K. M. (1983), Annu. Rev. Biochem. 52, 761-799.

LEGENDS TO THE FIGURES

FIG. 1A. Affinity chromatography on fibronectin-Sepharose

A lysate (86 ml) obtained by cold osmotic shock of the E. coli clone pFR001 was mixed with 29 ml of 2M ammonium acetate. The sample was then applied to a column (1.9×5.7 cm) equilibrated with 0.5M ammonium acetate at a flow rate of 50 ml/h. After the application of the sample the column was washed with four column volumes of 0.5M ammonium acetate at a flow rate of 20 ml/h. At the same time the sensitivity of the UV-monitor was increased by a factor of five. The material eluting between 200 and 220 ml was pooled, neutralized with ammonium hydroxide and dialysed against 10 mM ammonium acetate, pH 7.6.

FIG. 1B. Ion-exchange chromatography

A sample (10 ml) of the dialysed material from the affinity-chromatography described in FIG. 1A was applied to a 1 ml Mono Q (Pharmacia, Sweden) anion column equilibrated with 10 mM ammonium acetate, pH 7.6. The flow rate was 2 ml/min and the column was eluted by a liner increase in the concentration of ammonium acetate of 25 mM per ml. The two peaks (I and II) were pooled as indicated in the figure.

FIG. 2. Polyacrylamide gel electrophoresis of materials from the different purification steps of an osmotic shock lysate of E. coli pFR001

Material applied (lane 1) to the affinity column (fibronectin-Sepharose), unadsorbed (lane 2) and adsorbed (lane 3) material. Ion exchange chromatography of affinity purified material on a Mono Q FPLC column (Pharmacia, Sweden): pool I (lane 4) and pool II (lane 5) as marked in FIG. 1B.

FIG. 3. Restriction map, subclones and deletions of the insert in pFR001

(A) Restriction map of the 6.5 kb insert. (B) various subclones constructed in order to determine the region of the gene which codes for the fibronectin binding activity. (C) Deletions made from either the 3' or the 5'end of the EcoRI-PstI-fragment by treatment with exonuclease Bal31 (see text for details). The fibronectin binding activity for the differnt gene products is indicated.

FIG. 4. Polyacrylamide gel electrophoresis of the ZZ-FR protein

Protein A (Sigma Chemical Co, St. Louis; lane A) and the ZZ-FR fusion protein (lane B) were reduced and subjected to electrophoresis in SDS on a 5-15% polyacrylamide gel. The gel was subsequently stained with Coomassie blue. In lanes C & D the ZZ-FR fusion protein and protein A, respectively, were fractionated by SDS-electrophoresis on a 5-9% polyacrylamide gel, electroblotted to nitrocellulose paper and probed with ¹²⁵ I-labelled 29 kD fibronectin fragment as described (Froman et al, 1987). Arrows indicate the migration distance of standard proteins of known molecular weights.

FIG. 5. Affinity chromatography of the ZZ-FR protein

The ZZ-FR fusion protein (0.6 mg; panel A) and protein A (Sigma Chemical Co; 0,5 mg; panel B) were applied to a 7 ml column of Sepharose 4B CL substituted with 29 kD fibronectin fragment. The column was washed with 0.5M NaCl in PBS and subsequently eluted with 6M GuHCl in PBS. Fractions of 2 ml were collected and assayed for protein using the BioRad system.

FIG. 6. Polyacrylamide gel electrophoresis of the ZZ-FR fusion protein preparation, fractionated by affinity chromatography.

Material not bound (lane A) and bound and eluted (lane B), respectively, from the 29 kD affinity column was analyzed by SDS-gel electrophoresis on 5-15% polyacrylamide gels. The gel was subsequently stained with Coomassie blue. Arrows indicate the migration distance of standard proteins of known molecular weights.

FIG. 7. Inhibition of binding ¹²⁵ I-fibronectin to bacterial cells.

Staphylococcal cells (5×10⁷) of strain Newman (panel A) or strain 8325-4 (panel B) were incubated with 5×10⁴ cpm of ¹²⁵ I-labelled intact fibronectin (◯--◯) or 29 kD N-terminal fibronectin fragment (Δ--Δ) in PBS supplemented with 0,1% BSA, and 0.1% Tween® 80 in a total volume of 0.5 ml for 1 hr in the presence of increasing amounts of ZZ-FR fusion protein or protein A (--). For further details of the binding assay used, see Froman et al (1987). The amounts of ¹²⁵ I-radioactivity associated with bacterial cells was quantitated and the extent of fibronectin binding calculated. One hundred percent binding represent ¹²⁵ I-ligand bound to bacteria in absence of inhibiting protein and 0% binding corresponded to radioactivity recorded from incubation mixtures without bacteria.

FIG. 8A SEQ ID NO.: 6!. Sequence of nucleotide coding for fibronectin binding protein.

The nucleotide sequence coding for the fibronectin binding protein together with its corresponding amino acids is given. The different 38-amino acids repeats are marked, as well as appearing following almost four complete 14 amino acids repeats. The restriction sites BalI-HincII-PvuII have been noted in the figure, as well as have the amino acids sequences corresponding to the different nucleotide sequences.

FIG. 9 The fibronectin binding ability of a chemically synthezised polypeptide.

The fibronectin binding ability of the chemically synthezised 38 amino acids polypeptide according to Example 2 together with the fibronectin binding ability of the fragments of said polypeptide have been tested. (★--★) denotes the polypeptide corresponding to the 38(2) repeat of the amino acids sequence; (--) denotes the polypeptide corresponding to amino acids 1-19 of the amino acids sequence; (◯--◯) denotes the polypeptide corresponding to amino acids 20-38 of the amino acids sequence; (Δ--Δ) denotes the polypeptide corresponding to amino acids no. 9-30 of the amino acids sequence; and (- - - ) the polypeptide mixture comprising the polypeptides of amino acids 1-19, amino acids 20-38, amino acids 9-30. The binding ability in percent has been plotted against μg of polypeptide added.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 6                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 558 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GGCCAAAATAGCGGTAACCAGTCATTCGAGGAAGACACAGAAGAAGACAAACCTAAATAT60                 GAACAAGGTGGCAATATCGTAGATATCGATTTTGATAGTGTACCTCAAATTCATGGTCAA120                AATAAAGGTAATCAGTCATTCGAGGAAGATACAGAAAAAGACAAACCTAAGTATGAACAT180                GGCGGTAACATCATTGATATCGACTTCGACAGTGTGCCACATATTCACGGATTCAATAAG240                CACACTGAAATTATTGAAGAAGATACAAATAAAGATAAACCAAGTTATCAATTCGGTGGA300                CACAATAGTGTTGACTTTGAAGAAGATACACTTCCAAAAGTAAGCGGCCAAAATGAAGGT360                CAACAAAGCATTGAAGAAGATACAACACCTCCAATCGTGCCACCAACGCCACCGACACCA420                GAAGTACCAAGTGAGCCGGAAACACCAACGCCACCAACACCAGAAGTACCAAGTGAGCCG480                GAAACACCAACACCACCGACACCAGAAGTGCCGAGTGAGCCAGAAACTCCAACACCGCCA540                ACACCAGAGGTACCAGCT558                                                          (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 342 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        GGCCAAAATAGCGGTAACCAGTCATTCGAGGAAGACACAGAAGAAGACAAACCTAAATAT60                 GAACAAGGTGGCAATATCGTAGATATCGATTTTGATAGTGTACCTCAAATTCATGGTCAA120                AATAAAGGTAATCAGTCATTCGAGGAAGATACAGAAAAAGACAAACCTAAGTATGAACAT180                GGCGGTAACATCATTGATATCGACTTCGACAGTGTGCCACATATTCACGGATTCAATAAG240                CACACTGAAATTATTGAAGAAGATACAAATAAAGATAAACCAAGTTATCAATTCGGTGGA300                CACAATAGTGTTGACTTTGAAGAAGATACACTTCCAAAAGTA342                                  (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 38 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        GlyGlnAsnSerGlyAsnGlnSerPheGluGluAspThrGluGluAsp                               151015                                                                         LysProLysTyrGluGlnGlyGlyAsnIleValAspIleAspPheAsp                               202530                                                                         SerValProGlnIleHis                                                             35                                                                             (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 38 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        GlyGlnAsnLysGlyAsnGlnSerPheGluGluAspThrGluLysAsp                               151015                                                                         LysProLysTyrGluHisGlyGlyAsnIleIleAspIleAspPheAsp                               202530                                                                         SerValProHisIleHis                                                             35                                                                             (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 38 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        GlyPheAsnLysHisThrGluIleIleGluGluAspThrAsnLysAsp                               151015                                                                         LysProSerTyrGlnPheGlyGlyHisAsnSerValAspPheGluGlu                               202530                                                                         AspThrLeuProLysVal                                                             35                                                                             (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3280 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 57..3114                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        TGCATTTATTAAGTTTAAAAAATTAATGAATTTTGCATTTAAAGGGAGATATTATA56                     GTGAAAAACAATCTTAGGTACGGCATTAGAAAACATAAATTGGGAGCA104                            ValLysAsnAsnLeuArgTyrGlyIleArgLysHisLysLeuGlyAla                               151015                                                                         GCATCAGTATTCTTAGGAACAATGATCGTTGTTGGGATGGGACAAGAC152                            AlaSerValPheLeuGlyThrMetIleValValGlyMetGlyGlnAsp                               202530                                                                         AAAGAAGCTGCAGCATCAGAACAAAAGACAACTACAGTAGAAGAAAAT200                            LysGluAlaAlaAlaSerGluGlnLysThrThrThrValGluGluAsn                               354045                                                                         GGGAATTCAGCTACTGATAATAAAACAAGTGAAACACAAACAACTGCA248                            GlyAsnSerAlaThrAspAsnLysThrSerGluThrGlnThrThrAla                               505560                                                                         ACTAACGTTAATCATATAGAAGAAACTCGATCATATAACGCAACAGTA296                            ThrAsnValAsnHisIleGluGluThrArgSerTyrAsnAlaThrVal                               65707580                                                                       ACAGAACAACCGTCAAACGCAACACAAGTAACAACTGAAGAAGCACCA344                            ThrGluGlnProSerAsnAlaThrGlnValThrThrGluGluAlaPro                               859095                                                                         AAAGCAGTACAAGCACCACAAACTGCACAACCAGCAAATATAGAAACA392                            LysAlaValGlnAlaProGlnThrAlaGlnProAlaAsnIleGluThr                               100105110                                                                      GTTAAAGAAGAGGTAGTTAAGGAAGAAGCGAAACCTCGAGTTAAGGAA440                            ValLysGluGluValValLysGluGluAlaLysProArgValLysGlu                               115120125                                                                      ACAACACAATCTCAAGACAATAGCGGAGATCAAAGACAAGTAGATTTA488                            ThrThrGlnSerGlnAspAsnSerGlyAspGlnArgGlnValAspLeu                               130135140                                                                      ACACCTAAAAAGGCTACACAAAATCAAGTCGCAGAAACACAAGTTGAA536                            ThrProLysLysAlaThrGlnAsnGlnValAlaGluThrGlnValGlu                               145150155160                                                                   GTGGCACAGCCAAGAACGGCATCAGAAAGTAAGCCAGCTGTGACAAGA584                            ValAlaGlnProArgThrAlaSerGluSerLysProAlaValThrArg                               165170175                                                                      TCAGCAGATGTAGCGGAAGCTAAGGAAGCTAGTAACGCGAAAGTCGAA632                            SerAlaAspValAlaGluAlaLysGluAlaSerAsnAlaLysValGlu                               180185190                                                                      ACGGGTACAGATGTAACAAGTAAAGTTACAGTAGAAATTGGTTCTATT680                            ThrGlyThrAspValThrSerLysValThrValGluIleGlySerIle                               195200205                                                                      GAGGGGCATAACAATACAAATAAAGTAGAACCTCATGCAGGACAACGA728                            GluGlyHisAsnAsnThrAsnLysValGluProHisAlaGlyGlnArg                               210215220                                                                      GCGGTACTAAAATATAAGTTGAAATTTGAGAATGGTTTACATCAAGGT776                            AlaValLeuLysTyrLysLeuLysPheGluAsnGlyLeuHisGlnGly                               225230235240                                                                   GACTACTTTGACTTTACTTTATCAAATAATGTAAATACGCATGGCGTA824                            AspTyrPheAspPheThrLeuSerAsnAsnValAsnThrHisGlyVal                               245250255                                                                      TCAACTGCTAGAAAAGTACCAGAAATTAAAAATGGTTCAGTCGTAATG872                            SerThrAlaArgLysValProGluIleLysAsnGlySerValValMet                               260265270                                                                      GCGACAGGTGAAGTTTTAGAAGGTGGAAAGATTAGATATACATTTACA920                            AlaThrGlyGluValLeuGluGlyGlyLysIleArgTyrThrPheThr                               275280285                                                                      AATGATATTCAAGATAAGGTTGATGTAACGGCTGAACTAGAAATTAAT968                            AsnAspIleGlnAspLysValAspValThrAlaGluLeuGluIleAsn                               290295300                                                                      TTATTTATTGATCCTAAAACTGTACAAACTAATGGAAATCAAACTATA1016                           LeuPheIleAspProLysThrValGlnThrAsnGlyAsnGlnThrIle                               305310315320                                                                   ACTTCAACACTAAATGAAGAACAAACTTCAAAGGAATTAGATGTTAAA1064                           ThrSerThrLeuAsnGluGluGlnThrSerLysGluLeuAspValLys                               325330335                                                                      TATAAAGATGGTATTGGGAATTATTATGCCAATTTAAATGGATCGATT1112                           TyrLysAspGlyIleGlyAsnTyrTyrAlaAsnLeuAsnGlySerIle                               340345350                                                                      GAGACATTTAATAAAGCGAATAATAGATTTTCGCATGTTGCATTTATT1160                           GluThrPheAsnLysAlaAsnAsnArgPheSerHisValAlaPheIle                               355360365                                                                      AAACCTAATAATGGTAAAACGACAAGTGTGACTGTTACTGGAACTTTA1208                           LysProAsnAsnGlyLysThrThrSerValThrValThrGlyThrLeu                               370375380                                                                      ATGAAAGGTAGTAATCAGAATGGAAATCAACCAAAAGTTAGGATATTT1256                           MetLysGlySerAsnGlnAsnGlyAsnGlnProLysValArgIlePhe                               385390395400                                                                   GAATACTTGGGTAATAATGAAGACATAGCGAAGAGTGTATATGCAAAT1304                           GluTyrLeuGlyAsnAsnGluAspIleAlaLysSerValTyrAlaAsn                               405410415                                                                      ACGACAGATACTTCTAAATTTAAAGAAGTCACAAGTAATATGAGTGGG1352                           ThrThrAspThrSerLysPheLysGluValThrSerAsnMetSerGly                               420425430                                                                      AATTTGAATTTACAAAATAATGGAAGCTATTCATTGAATATAGAAAAT1400                           AsnLeuAsnLeuGlnAsnAsnGlySerTyrSerLeuAsnIleGluAsn                               435440445                                                                      CTAGATAAAACTTATGTTGTTCACTATGATGGAGAGTATTTAAATGGT1448                           LeuAspLysThrTyrValValHisTyrAspGlyGluTyrLeuAsnGly                               450455460                                                                      ACTGATGAAGTTGATTTTAGAACACAAATGGTAGGACATCCAGAGCAA1496                           ThrAspGluValAspPheArgThrGlnMetValGlyHisProGluGln                               465470475480                                                                   CTTTATAAGTATTATTATGATAGAGGATATACCTTAACTTGGGATAAT1544                           LeuTyrLysTyrTyrTyrAspArgGlyTyrThrLeuThrTrpAspAsn                               485490495                                                                      GGTTTAGTTTTATACAGTAATAAAGCGAACGGAAATGAGAAAAATGGT1592                           GlyLeuValLeuTyrSerAsnLysAlaAsnGlyAsnGluLysAsnGly                               500505510                                                                      CCGATTATTCAAAATAATAAATTTGAATATAAAGAAGATACAATTAAA1640                           ProIleIleGlnAsnAsnLysPheGluTyrLysGluAspThrIleLys                               515520525                                                                      GAAACTCTTACAGGTCAATATGATAAGAATTTAGTAACTACTGTTGAA1688                           GluThrLeuThrGlyGlnTyrAspLysAsnLeuValThrThrValGlu                               530535540                                                                      GAGGAATATGATTCATCAACTCTTGACATTGATTACCACACAGCTATA1736                           GluGluTyrAspSerSerThrLeuAspIleAspTyrHisThrAlaIle                               545550555560                                                                   GATGGTGGAGGTGGATATGTTGATGGATACATTGAAACAATAGAAGAA1784                           AspGlyGlyGlyGlyTyrValAspGlyTyrIleGluThrIleGluGlu                               565570575                                                                      ACGGATTCATCAGCTATTGATATCGATTACCATACTGCTGTGGATAGC1832                           ThrAspSerSerAlaIleAspIleAspTyrHisThrAlaValAspSer                               580585590                                                                      GAAGCAGGTCACGTTGGAGGATACACTGAGTCCTCTGAGGAATCAAAT1880                           GluAlaGlyHisValGlyGlyTyrThrGluSerSerGluGluSerAsn                               595600605                                                                      CCAATTGACTTTGAAGAATCTACACATGAAAATTCAAAACATCACGCT1928                           ProIleAspPheGluGluSerThrHisGluAsnSerLysHisHisAla                               610615620                                                                      GATGTTGTTGAATATGAAGAAGATACAAACCCAGGTGGTGGTCAGGTT1976                           AspValValGluTyrGluGluAspThrAsnProGlyGlyGlyGlnVal                               625630635640                                                                   ACTACTGAGTCTAATTTAGTTGAATTTGACGAAGAGTCTACAAAAGGT2024                           ThrThrGluSerAsnLeuValGluPheAspGluGluSerThrLysGly                               645650655                                                                      ATTGTAACTGGCGCAGTGAGCGATCATACAACAGTTGAAGATACGAAA2072                           IleValThrGlyAlaValSerAspHisThrThrValGluAspThrLys                               660665670                                                                      GAATATACAACTGAAAGTAATCTGATTGAATTAGTGGATGAATTACCT2120                           GluTyrThrThrGluSerAsnLeuIleGluLeuValAspGluLeuPro                               675680685                                                                      GAAGAGCATGGTCAAGCACAAGGACCAGTCGAGGAAATTACTAAAAAC2168                           GluGluHisGlyGlnAlaGlnGlyProValGluGluIleThrLysAsn                               690695700                                                                      AATCATCATATTTCTCATTCTGGTTTAGGAACTGAAAATGGTCACGGG2216                           AsnHisHisIleSerHisSerGlyLeuGlyThrGluAsnGlyHisGly                               705710715720                                                                   AATTATGACGTGATTGAAGAAATCGAAGAAAATAGCCACGTTGATATT2264                           AsnTyrAspValIleGluGluIleGluGluAsnSerHisValAspIle                               725730735                                                                      AAGAGTGAATTAGGTTATGAAGGTGGCCAAAATAGCGGTAACCAGTCA2312                           LysSerGluLeuGlyTyrGluGlyGlyGlnAsnSerGlyAsnGlnSer                               740745750                                                                      TTCGAGGAAGACACAGAAGAAGACAAACCTAAATATGAACAAGGTGGC2360                           PheGluGluAspThrGluGluAspLysProLysTyrGluGlnGlyGly                               755760765                                                                      AATATCGTAGATATCGATTTTGATAGTGTACCTCAAATTCATGGTCAA2408                           AsnIleValAspIleAspPheAspSerValProGlnIleHisGlyGln                               770775780                                                                      AATAAAGGTAATCAGTCATTCGAGGAAGATACAGAAAAAGACAAACCT2456                           AsnLysGlyAsnGlnSerPheGluGluAspThrGluLysAspLysPro                               785790795800                                                                   AAGTATGAACATGGCGGTAACATCATTGATATCGACTTCGACAGTGTG2504                           LysTyrGluHisGlyGlyAsnIleIleAspIleAspPheAspSerVal                               805810815                                                                      CCACATATTCACGGATTCAATAAGCACACTGAAATTATTGAAGAAGAT2552                           ProHisIleHisGlyPheAsnLysHisThrGluIleIleGluGluAsp                               820825830                                                                      ACAAATAAAGATAAACCAAGTTATCAATTCGGTGGACACAATAGTGTT2600                           ThrAsnLysAspLysProSerTyrGlnPheGlyGlyHisAsnSerVal                               835840845                                                                      GACTTTGAAGAAGATACACTTCCAAAAGTAAGCGGCCAAAATGAAGGT2648                           AspPheGluGluAspThrLeuProLysValSerGlyGlnAsnGluGly                               850855860                                                                      CAACAAACGATTGAAGAAGATACAACACCTCCAATCGTGCCACCAACG2696                           GlnGlnThrIleGluGluAspThrThrProProIleValProProThr                               865870875880                                                                   CCACCGACACCAGAAGTACCAAGTGAGCCGGAAACACCAACGCCACCA2744                           ProProThrProGluValProSerGluProGluThrProThrProPro                               885890895                                                                      ACACCAGAAGTACCAAGTGAGCCGGAAACACCAACACCACCGACACCA2792                           ThrProGluValProSerGluProGluThrProThrProProThrPro                               900905910                                                                      GAAGTGCCGAGTGAGCCAGAAACTCCAACACCGCCAACACCAGAGGTA2840                           GluValProSerGluProGluThrProThrProProThrProGluVal                               915920925                                                                      CCAGCTGAACCTGGTAAACCAGTACCACCTGCCAAAGAAGAACCTAAA2888                           ProAlaGluProGlyLysProValProProAlaLysGluGluProLys                               930935940                                                                      AAGCCTTCTAAACCAGTGGAACAAGGTAAAGTAGTAACACCTGTTATT2936                           LysProSerLysProValGluGlnGlyLysValValThrProValIle                               945950955960                                                                   GAAATCAATGAAAAGGTTAAAGCAGTGGCACCAACTAAAAAACCACAA2984                           GluIleAsnGluLysValLysAlaValAlaProThrLysLysProGln                               965970975                                                                      TCTAAGAAATCTGAACTACCTGAAACAGGTGGAGAAGAATCAACAAAC3032                           SerLysLysSerGluLeuProGluThrGlyGlyGluGluSerThrAsn                               980985990                                                                      AAAGGTATGTTGTTCGGCGGATTATTCAGCATTCTAGGTTTAGCATTA3080                           LysGlyMetLeuPheGlyGlyLeuPheSerIleLeuGlyLeuAlaLeu                               99510001005                                                                    TTACGCAGAAATAAAAAGAATCACAAAGCATAATAAACAAAAAT3124                               LeuArgArgAsnLysLysAsnHisLysAla*                                                10101015                                                                       TGACGGGTTTATTTCATAAATTATATGAAGTAAGCCTGTTTTTTTATATTAAATCAAATT3184               TCTAATAGAAAATTAGAGTGTTTTCTGATTGCTTCATTGGTTTATGTCTGATGATTGATA3244               ACGAACTGAGAGATAAAGTTAGAATTTTAAAGTAGT3280                                       __________________________________________________________________________ 

We claim:
 1. An isolated fibronectin binding protein or polypeptide comprising at least one of the amino acid sequencesGly Gln Asn Ser Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro Lys Tyr Glu Gln Gly Gly Asn Ile Val Asp Ile Asp Phe Asp Ser Val Pro Gln Ile His (SEQ ID NO: 3) Gly Gln Asn Lys Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Lys Asp Lys Pro Lys Tyr Glu His Gly Gly Asn Ile Ile Asp Ile Asp Phe Asp Ser Val Pro His Ile His (SEQ ID NO: 4) Gly Phe Asn Lys His Thr Glu Ile Ile Glu Glu Asp Thr Asn Lys Asp Lys Pro Ser Tyr Gln Phe Gly Gly His Asn Ser Val Asp Phe Glu Glu Asp Thr Leu Pro Lys Val (SEQ ID NO: 5).2.
 2. The protein according to claim 1 comprising:Gly Gln Asn Ser Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro Lys Tyr Glu Gln Gly Gly Asn Ile Val Asp Ile Asp Phe Asp Ser Val Pro Gln Ile His (SEQ ID NO: 3)and Gly Gln Asn Lys Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Lys Asp Lys Pro Lys Tyr Glu His Gly Gly Asn Ile Ile Asp Ile Asp Phe Asp Ser Val Pro His Ile His (SEQ ID NO: 4).3.
 3. The protein according to claim 1 comprising:Gly Gln Asn Ser Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro Lys Tyr Glu Gln Gly Gly Asn Ile Val Asp Ile Asp Phe Asp Ser Val Pro Gln Ile His (SEQ ID NO: 3)and Gly Phe Asn Lys His Thr Glu Ile Ile Glu Glu Asp Thr Asn Lys Asp Lys Pro Ser Tyr Gln Phe Gly Gly His Asn Ser Val Asp Phe Glu Glu Asp Thr Leu Pro Lys Val (SEQ ID NO: 5).4.
 4. The protein according to claim 1 comprising:Gly Gln Asn Lys Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Lys Asp Lys Pro Lys Tyr Glu His Gly Gly Asn Ile Ile Asp Ile Asp Phe Asp Ser Val Pro His Ile His (SEQ ID NO: 4)and Gly Phe Asn Lys His Thr Glu Ile Ile Glu Glu Asp Thr Asn Lys Asp Lys Pro Ser Tyr Gln Phe Gly Gly His Asn Ser Val Asp Phe Glu Glu Asp Thr Leu Pro Lys Val (SEQ ID NO: 5).5.
 5. The protein according to claim 1 comprising:Gly Gln Asn Ser Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro Lys Tyr Glu Gln Gly Gly Asn Ile Val Asp Ile Asp Phe Asp Ser Val Pro Gln Ile His (SEQ ID NO: 3) Gly Gln Asn Lys Gly Asn Gln Ser Phe Glu Glu Asp Thr Glu Lys Asp Lys Pro Lys Tyr Glu His Gly Gly Asn Ile Ile Asp Ile Asp Phe Asp Ser Val Pro His Ile His (SEQ ID NO: 4)and Gly Phe Asn Lys His Thr Glu Ile Ile Glu Glu Asp Thr Asn Lys Asp Lys Pro Ser Tyr Gln Phe Gly Gly His Asn Ser Val Asp Phe Glu Glu Asp Thr Leu Pro Lys Val (SEQ ID NO: 5). 